feat(aws): allow bedrock Application Inference Profile #9129

tinque · 2025-10-06T16:49:56Z

Implements #7809 and ports #7822 into the @langchain/aws library

changeset-bot · 2025-10-06T16:50:00Z

🦋 Changeset detected

Latest commit: 3181380

The changes in this PR will be included in the next version bump.

This PR includes changesets to release 1 package

Name	Type
@langchain/aws	Minor

Not sure what this means? Click here to learn what changesets are.

Click here if you're a maintainer who wants to add another changeset to this PR

vercel · 2025-10-06T16:50:02Z

@tinque is attempting to deploy a commit to the LangChain Team on Vercel.

A member of the Team first needs to authorize it.

vercel · 2025-10-06T16:50:04Z

The latest updates on your projects. Learn more about Vercel for GitHub.

1 Skipped Deployment

Project	Deployment	Preview	Updated (UTC)
langchainjs-docs	Ignored		Oct 6, 2025 4:54pm

christian-bromann · 2025-10-08T18:22:30Z

Really appreciate you taking the time to open this PR, @tinque 🙏
The team is currently focused on shipping our v1 release, so we’re pausing detailed reviews for a bit. We’ll follow up within the next 2–4 weeks once things settle — thanks so much for your patience!

tinque · 2025-10-09T07:29:20Z

I totally understand @christian-bromann that the team is focused on the v1 release — congrats on that milestone! 🎉

That said, this change is quite minor and already covered by unit tests.
Would it be possible to get a quick review or exception on this one? It’s currently blocking our production deployment on our side, so even a short-term workaround or early merge would be extremely helpful.

Thanks again for your time and all the great work you’re doing with LangChain!

Implements #7809 and ports langchain-ai#7822 into the @langchain/aws library

christian-bromann

Finally been able to take a look at this. What do you think to just extend the documentation for model to hint users that they can also pass in a Application Inference Profile ARN? Two concerns I have with this approach:

maintaining an additional field that represents the model in some cases
can we guarantee that the model defined behind the profile is the same specified in model? Should we check for that?

Thoughts?

tinque · 2025-10-29T19:56:40Z

Thanks Christian! 👋

Good points — here’s some context on that:

About maintaining an additional model field
That’s indeed the tricky part. If we only pass the inference profile ARN, it works fine for inference, but we lose the actual model name in the metadata.
That means we can’t properly track cost or latency per model in LangSmith for example — which is why keeping an explicit model field is useful.

About guaranteeing that the profile and model match
Unfortunately, there’s currently no API to “describe” an inference profile and retrieve the underlying model.
The profile ID is opaque, so we can’t programmatically validate that it matches the model field.

That’s why, for now, the safest approach is to let users explicitly define both — model (for metadata/tracking) and inferenceProfileArn (for execution).
Once AWS exposes an API to resolve profiles, we could definitely add a validation layer.

For context, this implementation follows the same approach as #7822, which was based on the discussion in #7809.

tinque · 2025-11-04T07:09:28Z

Hi 👋
Just a friendly reminder about this PR — is there anything I can do to help move it forward?
It’s aligned with the approach discussed in #7822 and #7809.
Thanks a lot for your time!

chabli · 2025-11-06T13:40:25Z

I'm also interested
Thank you

callmeGillou · 2025-11-06T13:40:58Z

Hey there,
Any news on this topic? Would love to implement it asap as we didn't find any workaround with LangSmith.
I'd appreciate your support. Thanks

chabli · 2025-11-06T13:41:00Z

It's a must have to use langsmith!

christian-bromann

@tinque thanks for pinging. I think your comments make sense. I am ok moving forward as is. One request though: can we add a section to the README.md on using application inference profiles and the implications we discussed in this thread?

@hntrl any concerns?

…ADME

tinque · 2025-11-10T18:37:27Z

Hey @christian-bromann !

Added documentation for the inference profile feature in the README. It covers the usage and explains why we need both parameters for proper metadata tracking.

Check out the latest commit and let me know if you'd like any changes! 👍

Copilot

Pull Request Overview

This PR adds support for AWS Bedrock Application Inference Profiles to the ChatBedrockConverse class, allowing users to route inference requests through custom endpoints that can manage cross-region traffic.

Adds applicationInferenceProfile parameter to override the model ID in API calls while preserving metadata tracking
Updates both streaming and non-streaming code paths to use the inference profile ARN when provided
Includes comprehensive test coverage for the new functionality

Reviewed Changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 1 comment.

File	Description
libs/providers/langchain-aws/src/chat_models.ts	Adds `applicationInferenceProfile` property and logic to use it as `modelId` in `ConverseCommand` and `ConverseStreamCommand` when provided
libs/providers/langchain-aws/src/tests/chat_models.test.ts	Adds comprehensive test suite covering initialization, command creation with/without inference profile for both streaming and non-streaming modes
libs/providers/langchain-aws/README.md	Documents the new Application Inference Profiles feature with usage examples and important notes about model metadata tracking
.changeset/wet-taxis-heal.md	Adds changeset entry marking this as a minor version feature addition

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

libs/providers/langchain-aws/src/chat_models.ts

Co-authored-by: Copilot <[email protected]>

christian-bromann

LGTM 👍

@hntrl thoughts?

tinque · 2025-11-17T10:15:08Z

Hi @christian-bromann @hntrl
Just a friendly reminder about this PR — is there anything I can do to help move it forward?

github-actions bot added the provider/aws label Oct 6, 2025

tinque force-pushed the application-inference-profile branch from 5982b00 to 2f5c28d Compare October 6, 2025 16:54

tinque force-pushed the application-inference-profile branch from 2f5c28d to 40befc0 Compare October 16, 2025 08:56

feat(aws): allow bedrock Application Inference Profile

2ed02b2

Implements #7809 and ports langchain-ai#7822 into the @langchain/aws library

tinque force-pushed the application-inference-profile branch from 16d3417 to 2ed02b2 Compare October 16, 2025 11:26

christian-bromann reviewed Oct 29, 2025

View reviewed changes

hntrl removed the provider/aws label Nov 7, 2025

tinque requested a review from christian-bromann November 10, 2025 10:08

christian-bromann reviewed Nov 10, 2025

View reviewed changes

docs: add usage instructions for Application Inference Profiles in RE…

cad28c8

…ADME

Copilot AI review requested due to automatic review settings November 10, 2025 18:35

github-actions bot added the pkg:@langchain/aws label Nov 10, 2025

tinque requested a review from christian-bromann November 10, 2025 18:37

Copilot AI reviewed Nov 10, 2025

View reviewed changes

libs/providers/langchain-aws/src/chat_models.ts Outdated Show resolved Hide resolved

Update libs/providers/langchain-aws/src/chat_models.ts

3181380

Co-authored-by: Copilot <[email protected]>

christian-bromann approved these changes Nov 12, 2025

View reviewed changes

feat(aws): allow bedrock Application Inference Profile #9129

Are you sure you want to change the base?

feat(aws): allow bedrock Application Inference Profile #9129

Conversation

tinque commented Oct 6, 2025

Uh oh!

changeset-bot bot commented Oct 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🦋 Changeset detected

Uh oh!

vercel bot commented Oct 6, 2025

Uh oh!

vercel bot commented Oct 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

christian-bromann commented Oct 8, 2025

Uh oh!

tinque commented Oct 9, 2025

Uh oh!

christian-bromann left a comment

Choose a reason for hiding this comment

Uh oh!

tinque commented Oct 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tinque commented Nov 4, 2025

Uh oh!

chabli commented Nov 6, 2025

Uh oh!

callmeGillou commented Nov 6, 2025

Uh oh!

chabli commented Nov 6, 2025

Uh oh!

christian-bromann left a comment

Choose a reason for hiding this comment

Uh oh!

tinque commented Nov 10, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

christian-bromann left a comment

Choose a reason for hiding this comment

Uh oh!

tinque commented Nov 17, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

changeset-bot bot commented Oct 6, 2025 •

edited

Loading

vercel bot commented Oct 6, 2025 •

edited

Loading

tinque commented Oct 29, 2025 •

edited

Loading